Logic for Part-of-Speech Tagging and Shallow Parsing
نویسنده
چکیده
I n -fliis p ap er, a p u re ly log ical approach to p a rt-o f-sp eec h tag g in g an d sh a llo w p a rs in g is exp lo red . I t h as a lo t in co m m o n w ith red u c tio n is t p a rs in g s tra teg ies such as th o se em p lo y ed in C o n stra in t G ram m ar (K arlsso n e t al. 1994) an d F in ite -S ta te In te rsec tio n G ram m ar (K o sk en n iem i 1990), b u t ru les are fo rm u la ted en tire ly in log ic , an d a m o d e l g en era tio n th eo rem p ro v e r is u sed fo r p a rt-o f-sp eech ta g g in g and parsing .
منابع مشابه
An improved joint model: POS tagging and dependency parsing
Dependency parsing is a way of syntactic parsing and a natural language that automatically analyzes the dependency structure of sentences, and the input for each sentence creates a dependency graph. Part-Of-Speech (POS) tagging is a prerequisite for dependency parsing. Generally, dependency parsers do the POS tagging task along with dependency parsing in a pipeline mode. Unfortunately, in pipel...
متن کاملبررسی مقایسهای تأثیر برچسبزنی مقولات دستوری بر تجزیه در پردازش خودکار زبان فارسی
In this paper, the role of Part-of-Speech (POS) tagging for parsing in automatic processing of the Persian language is studied. To this end, the impact of the quality of POS tagging as well as the impact of the quantity of information available in the POS tags on parsing are studied. To reach the goals, three parsing scenarios are proposed and compared. In the first scenario, the parser assigns...
متن کاملShallow Parsing as Part-of-Speech Tagging
Treating shallow parsing as part-of-speech tagging yields results comparable with other, more elaborate approaches. Using the CoNLL 2000 training and testing material, our best model had an accuracy of 94.88%, with an overall FB1 score of 91.94%. The individual FB1 scores for NPs were 92.19%, VPs 92.70% and PPs 96.69%.
متن کاملPart of Speech Tagging and Shallow Parsing of Indian Languages
This paper describes and evaluates shallow parsing of several Indian languages utilizing Conditional Random Field models. We show how performance can be substantially improved by several feature enhancements and improved modeling techniques, including expanding the chunk tag inventory, and separating punctuation from linguistic phrases. We also report results from part of speech tagging of Hind...
متن کاملLinguistic-prosodic processing for text-to-speech synthesis in italian
The linguistic-prosodic processing applied to text-to-speech synthesis in Italian is described. It proceeds in 5 steps: tokenisation and normalisation of abbreviations, numbers, etc.; part-of-speech tagging, based on function words, terminations and contextual heuristics; shallow parsing, based on a chunk grammar; grapheme-to-phoneme conversion, lexical stress assignment and syllabification by ...
متن کاملRobust Tagging System for Lexicon Creation
This paper presents a robust rule-based system of shallow parsing for part-of-speech (PoS) recognition and tagging. Unlike previous work the system uses parsing to tagging based on unsupervised learning methods with no prior knowledge, nor training or pre-tagged corpora. START (System of Textual Analysis Recognition and Tagging) has been evaluated on both French and Greek non-annotated corpora,...
متن کامل